Quark Masses and Chiral Symmetry 



Michael Creutz* 

Physics Department, Brookhaven National Laboratory, Upton, NY 11973; email: creutz@bnl.gov 

(May 10, 1995) 



Abstract 



I discuss the global structure of the strongly interacting gauge theory of 
quarks and gluons as a function of the quark masses and the CP violating 
parameter 9. I concentrate on whether a first order phase transition occurs 
at 9 = 7T. I show why this is expected when multiple flavors have a small 
degenerate mass. This transition can be removed by sufficient flavor-breaking. 
I speculate on the implications of this structure for Wilson's lattice fermions. 
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I. INTRODUCTION 



This paper concerns the mass term mipip in the standard hadronic gauge theory of quarks 
and gluons. I was originally motivated by attempts to understand chiral symmetry on the 
lattice, but realized that my general understanding of chiral symmetry was inadequate. Thus 
arose the present discussion in a continuum framework. Indeed, this might be classified as a 
chiral-Lagrangian paper, in that I use primarily symmetry arguments to explore the theory 
as a function of the quark masses. 

I will be quite cavalier about defining products of fermion fields, such as the mass term 
above. I assume that some regulator, such as the lattice, has made these quantities well 
defined. Some caution is needed with regard to issues such as the chiral anomalies, which 
arise because of the singular nature of products of fields at a single space-time point. I will, 
however, ignore other complications of the regulator, except when I explicitly conjecture on 
the role of the lattice doublers. 

One of my goals is to provide an intuitive picture for the physical meaning of the CP- 
violating parameter of the strong interactions. This term, often called the 8 term, is usually 
discussed in terms of topological excitations of the gauge fields. Here, however, I treat it 
entirely in terms of the chiral symmetries expected in the massless limit of the theory. 

Among the conclusions is that a first-order transition is expected at 8 = ir when the 
flavors have a small but degenerate mass. This transition can be removed if flavor-breaking 
is large enough. At the transition, CP is spontaneously broken. I will also make a few 
remarks on the implications for the structure of Wilson's lattice fermions. 

This is a subject with a long history, and much of what I say is buried in numerous 
previous studies. The implications of 8 to the fermion mass matrix are well known to low- 
energy chiral Lagrangian discussions [1-8]. The occurrence of a first-order phase transition 
at large 8 has been discussed in [2]. The possibility of a spontaneous breaking of CP was 
pointed out even before the significance of the parameter 8 was appreciated [3]. The relation 
of 8 to lattice Wilson fermions was elucidated some time ago by Seiler and Stamatescu [9] 
and was the subject of some recent work of my own [10]. My main new contributions are 
hopefully some added intuition to the understanding of these issues and more support for 
the structure of Wilson fermions presented in [10]. 

The sign of the fermion mass is sometimes regarded as a convention. Consider a Feynman 
diagram involving a fermion loop interacting with an arbitrary number of gauge boson lines. 
Insert a factor of 1 = (7s) 2 at one vertex, and then move one of the factors of 75 around the 
loop, anti-commuting it with each gamma matrix it encounters. This reproduces the formal 
expression for the diagram but with each factor of m replaced by — m. Thus we naively 
conclude that the physics of a gauge theory is unaltered by a change of the sign of the mass 
term. 

This conclusion, however, is not true in general. It is probably correct for ordinary 
quantum electrodynamics in four space-time dimensions, where by Furry's theorem [11] 
there are no triangle diagrams and corresponding anomalies. However, it is explicitly false 
for the massive Schwinger model of electrodynamics in two space-time dimensions [12,10]. 
Furthermore, as the remaining discussion in this paper will argue, it is almost certainly true 
that hadronic physics would change if the sign of one of the quark masses were flipped. 
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To be a bit more general, consider a change of variables 




(1) 



Since 1 = (75) 



2 , this modifies the fermion mass term to 



mipip 




where 



mi = m cos(#) 
m 2 = m sin(#). 



(3) 



The kinetic and gauge terms of the gauge-theory action are formally invariant under this 
transformation. Thus, were one to start with the more general mass term of Eq. (2), one 
might expect a physical situation independent of 6. However, because of the chiral anomaly, 
this is not true. The angle 6 represents a further non-trivial parameter of the strong in- 
teractions, beyond the fermion masses. Its non- vanishing would give rise to CP violating 
processes. As such are not observed in hadronic physics, the numerical value of 6 must be 
very small [5]. 

If Eq. (1) just represents a change of variables, how can this affect physics? The reason 
is entwined with the divergences of quantum field theory and the necessity of regulariza- 
tion. Fujikawa [13] has shown how the anomaly can be incorporated into the path integral 
formulation via the the fermionic measure, which becomes non-invariant under the above 
chiral rotation. More specifically, under a Pauli-Villars [14] approach 6 represents a relative 
75 rotation between the mass term for the fundamental particle and the mass term for the 
heavy regulator field. On the lattice with Wilson's fermion prescription [15], the doublers 
play this role of defining the relative chiral phase [9,10]. 

From this point of view, the fermion doublers of lattice gauge theory are not a nemesis, 
but rather are necessary to the physics of 6 and the chiral anomaly. While the Wilson term 
does represent an explicit breaking of chiral symmetry, it is philosophically no worse than 
the heavy auxiliary field used in the Pauli-Villars approach. 

Coleman [12] discussed the physics of this extra parameter in two dimensional electro- 
dynamics, where it represents a background electric field. In Ref. [10] I used these results to 
infer a possible expected behavior of Wilson lattice fermions in both two and four dimen- 
sions. In particular I proposed generalized phase diagrams in the space of the parameters 
mi and m 2 . In this paper I show how some of these features follow directly from chiral 
symmetries and details of the known particle spectrum. I frame the present discussion in 
the context of the continuum theory after any regulator has been removed. 

The resulting diagrams are strongly dependent on the number of fermion flavors. With 
a single species, a first-order phase transition line runs down the negative mi axis, starting 
at a non zero value for m\. This is sketched in Fig. (1). For two flavors the details depend 
on the sign of a term in the effective action, but I argue for two first-order phase transition 
lines, starting near the origin and running up and down the m 2 axis. For degenerate quarks 
these transitions meet at the chiral limit of vanishing fermion mass, while a small flavor 
breaking can separate the endpoints of these first-order lines. This is sketched in Fig. (2). 
The chiral limit is pinched between these endpoints. With Nf > 2 flavors, the argument is 
sharper, with the (mi,m 2 ) plane having Nf first order phase transition lines all pointing at 
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the origin. The conventionally normalized parameter 6 is Nf times the angle to a point in 
this plane, and these transition lines are each equivalent to 8 going through ir. 

Whenever the number of flavors is odd, there is a first-order transition running down 
the negative ra\ axis. Along this line there is a spontaneous breaking of CP, with a natural 
order parameter being (iip"f 5 ip). This possibility of a spontaneous breakdown of CP was 
noted some time ago by Dashen [3] and has reappeared at various times in the lattice 
context [16,17]. 

I begin my detailed discussion with the two flavor case. Here several simplifications make 
the physics particularly transparent. I then discuss how the one flavor result arises when 
one of these flavors is taken to a large mass. From this I conjecture an analogy with heavy 
doublers and Wilson lattice fermions. Finally, I discuss the general Nf situation. 

II. TWO FLAVORS 

I begin by defining eight fields around which the discussion revolves 

a = cipip 

vr = iapj 5 TTp , . 

rj = icipj 5 ip 
6 = ciprip. 

The fermion ip has two isospin components, for which t represents the standard Pauli ma- 
trices. The factor c is inserted to give the fields their usual dimensions. Its value is not 
particularly relevant to the qualitative discussion that follows, but one convention is take 
c = F/\(ipip)\ where F is the pion decay constant and the condensate is in the standard 
vacuum. 

Corresponding to each of these quantities is a physical spectrum. In some cases this is 
dominated by a known particle. There is the familiar triplet of pions around 140 MeV and 
the eta at 547 MeV. The others are not quite so clean, with a candidate for the isoscalar a 
being the / O (980) and for the isovector 6 being the a o (980). These detailed identifications 
are not particularly important to the following discussion. One fact I emphasize is that the 
lightest particle in the 6 channel appears to be heavier than the r\. 

Now consider an effective potential V(cr, 7r, 77, 6) constructed for these fields. There are 
various formal ways of doing this, either from a Legendre transform on the generating 
function, or more physically by asking for the state of minimum energy given a set of 
expectation values for the fields. Two comments are in order. First, the fact that we are 
working with composite fields is inessential to the construction of V . Indeed, many of us still 
have qualms about distinguishing between elementary and composite fields. Second, formally 
the effective potential must be convex. Physically, this is because of a phase separation which 
would occur if an expectation value is held in an unstable region. Multiple minima are filled 
in by a Maxwell construction. This is a rather technical but well understood point which I 
ignore for the sake of pedagogy. 

I first consider the theory with vanishing quark masses. I remind the reader that this limit 
is remarkable in that it is totally free of any adjustable parameters. In the continuum limit, 
the strong coupling constant is absorbed via the phenomenon of dimensional transmutation 
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[18], and all dimensionless quantities are determined. In the full theory with the quark 
masses turned back on, the only parameters are those masses and 6. 

For the massless theory many of the chiral symmetries become exact. Because of the 
anomaly, the transformation of Eq. (1), which mixes the a and r] fields, is not a good 
symmetry. However flavored axial rotations should be valid. For example, the rotation 

^ — > e 11 ^ 12 ^. (5) 

mixes a with 7r 3 

a — > + cos((j))a + sin(^)7r 3 
7r 3 — > — s'm((j))a + cos((j))ir 3 

This transformation also mixes rj with 63 

rj — > + cos(</>)?7 + s'm((j>)6 3 
$3 > - s'm((j))r] + cos((j>)6 3 

For the massless theory, the effective potential must be invariant under such rotations. 
In this two flavor case, the consequences can be compactly expressed by going to a vector 
notation. I define the four component objects E = (c, 7r) and A = {rj,S). Chiral symmetry 
then implies that the effective potential is a function only of invariants constructed from 
these four vectors. A complete set of invariants is {E 2 , A 2 , E • A}. 

This separation into two independent sets of fields is special to the two flavor case, but 
makes the behavior of the theory particularly transparent. When I turn to more flavors, the 
arguments must be modified, but the structure of the higher symmetries actually make the 
conclusions a bit stronger. 

I now use the experimental fact that chiral symmetry appears to be spontaneously broken. 
The minimum of the effective potential should not occur for all fields having vanishing 
expectation. We also know that parity and flavor appear to be good symmetries of the 
strong interactions, and thus the expectation value of the fields can be chosen in the a 
direction. Temporarily ignoring the fields A, I expect the potential to have the canonical 
"sombrero" shape, as would be stereotyped with the form 

V = A(E 2 -v 2 ) 2 = \( a 2 + tt 2 - v 2 ) 2 (8) 

Here v is the magnitude of the vacuum expectation value for a, and A is a coupling strength 
related to the a mass. The normalization convention mentioned below Eq. (4) would have 
v = F/2. I sketch the generic structure of the potential in Fig. (3). This gives the stan- 
dard picture wherein pions are Goldstone bosons associated with fields oscillating along the 
degenerate minima of the potential. 

Now consider the influence of the fields A on this potential. Taking only small values 
for these fields, I expand the potential about vanishing A 

F = A(E 2 -^ 2 ) 2 + aA 2 -/3(E-A) 2 + ... (9) 

Here the coefficients a and (3 are functions of E 2 . For most purposes the value of the latter 
is approximately v 2 ; so, we can think of these coefficients as constants. Note also that since 
E • A is odd under parity, the expansion in this quantity starts off quadratically. 
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(6) 



(7) 



The terms proportional to a and (3 generate masses for the r] and 8 particles. Since A 2 = 
j] 2 + 8 2 , the a term gives them equal masses. Substituting E ~ (i>, 0) gives (E • A) 2 ~ v 2 r] 2 ; 
thus, the /3 term breaks the rj-6 degeneracy. Here is where the observation that the r] is 
lighter than the 6 comes into play; I have written a minus sign in Eq. (9), thus making the 
expected sign of /3 positive. I assume this for now, but will comment later on the changes 
that would occur were /3 negative. 

This discussion of the masses is not completely rigorous. I have ignored possible differ- 
ences in the wave function renormalizations for the respective particles, and higher terms in 
the expansion of Eq. (9) could also affect the physical masses. However, all I really need for 
my later conclusions is the sign of the effective coupling between the E and the A fields. 

In conventional discussions the r] (rj' for 3 flavors) acquires its mass via topological 
excitations in the gauge fields. These effects are buried in the coefficients a and /3. I 
only assume that these are generated appropriately, and make no comments on the detailed 
mechanisms. 

Now I turn on the fermion masses. I consider small masses, and assume they appear as 
a general linear perturbation of the effective potential 

V — >F-(M 1 -E + M 2 -A)/c. (10) 

Here the four-component objects Mi >2 represent the possible mass terms. The normalization 
constant c appears in Eq. (4). The zeroth component of Mi gives a conventional mass term 
proportional to ipip, contributing equally to both flavors. The mass splitting of the up and 
down quarks appears naturally in the third component of M 2 , multiplying ipr 3 ip. The term 
m 2 of Eq. (2) lies in the zeroth component of M 2 . 

The two four vectors Mi >2 represent a total of 8 possible "mass terms." However, the 
chiral symmetries of the problem tell us that physics can only depend on invariants. For 
these I can take M\, M 2 2 , and M x ■ M 2 . That there are three parameters is reassuring; there 
are two quark masses m u and ra d as well as the CP violating parameter 8. The mapping 
between these parameterizations is non-linear, the conventional definitions giving 

M l = ( m l + m d)/^ + m um d cos(0)/2 

Ml = (m 2 + m 2 )/4-m u m £i cos(0)/2 (11) 
Mi • M 2 = m u m d sm(8)/2 

Note if one of the quark masses, say m u , vanishes, then the 8 dependence drops out. While 
this may be a possible way to remove any unwanted CP violation from the strong interac- 
tions, having a single quark mass vanish represents a fine tuning which is not obviously more 
compelling than simply tuning 8 to zero. Also, having m u = appears to be phenomeno- 
logically untenable [7,8]. 

I now give a physical picture of what the two mass terms Mi and M 2 do to the "Mexican 
hat" structure of the massless potential. For Mi this is easy; its simply tilts the sombrero. 
This is sketched in Fig. (4). The symmetry breaking is no longer spontaneous, with the 
tilt selecting the direction for E field to acquire its expectation value. This picture is well 
known, giving rise to standard relations such as the square of the pion mass being linearly 
proportional to the quark mass [19]. 

The effect of M 2 is more subtle and represents my main interest here. This quantity 
has no direct coupling to the E field; so, I must look for a higher order effect. The M 2 
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term represents a force pulling on the A field. It should give the latter an expectation value 
proportional to the strength, (A) oc M 2 . Once A gains an expectation value, it then effects 
E through the a and (3 terms of the potential in Eq. (9). The a term is a function only of E 2 , 
and, at least for small M 2 , should not qualitatively change the structure of the symmetry 
breaking. On the other hand, the (3 term will warp the shape of our sombrero. As this 
term is quadratic in E • A, this warping is quadratic in the strength of M 2 and quadratic in 
E. With (3 positive, as suggested above, this favors an expectation value of E lying along 
the vector M 2 , but the sign of this expectation is undetermined. This effect is sketched in 
Fig- (5). 

To summarize, the effect of M\ is to tilt our Mexican hat, while the effect of M 2 is 
to install a quadratic warping. The three parameters of the theory are the amount of 
tilt, the amount of warping, and, finally, the relative angle between these effects. To better 
understand the interplay of these various phenomena, I now consider three specific situations 
in more detail. 

III. CASE A: M 1 \\M 2 

First consider Mi and M 2 parallel in the four vector sense. This is the situation when we 
have the two mass terms of Eq. (2) and no explicit breaking of flavor symmetry. Specifically, 
I take Mi = (mi, 0) and M 2 = (m 2 , 0). In this case the warping and the tilting discussed in 
the last section are along the same axis. 

Suppose I consider m 2 at some non- vanishing fixed value, and study the behavior of the 
vacuum as mi is varied. The m 2 term has warped the sombrero, but if mi is large enough, 
the potential will have a unique minimum in the direction of this pull. As mi is reduced in 
magnitude, the tilt decreases, and eventually the warping generates a second local minimum 
in the opposite sigma direction. As mi passes through zero, this second minimum becomes 
the lower of the two, and a first-order phase transition must occur exactly at mi = 0. This 
situation is sketched in Fig. (6). From Eq. (11) we see that this situation represents m u = m^ 
and 8 = ir. 

As m 2 decreases, the warping decreases, reducing the barrier between the two minima. 
This makes the transition softer. A small further perturbation in, say, the 7r 3 direction, will 
tilt the sombrero a bit to the side. If the warping is small enough, the field can then roll 
around the preferred side of the hat, thus opening a gap separating the positive m 2 phase 
transition line from that at negative m 2 . In this way sufficient flavor breaking can remove 
the first-order phase transition at 8 = ir. If I start at 8 = with a mass splitting between 
the up and down quarks, an isoscalar chiral rotation to generate non-zero 8 will generate 
just such a term. 

Is the physical quark mass splitting sufficient to remove the 8 = ir transition? Given 
that this splitting is of comparable magnitude to the masses themselves, this is difficult to 
decide. Witten has argued [2], based on the largeness of the gauge group SU(3), that the 
physical up-down mass difference is indeed too large for this transition to occur. On the 
other hand, it is a rather academic question, since changing 8 is on the same footing as 
changing the quark masses, all of which have been predetermined by nature. 
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IV. CASE B: M\ _L M 2 



I now turn to a situation where Mi and M 2 are orthogonal. To be specific, take Mi = 



the sombrero downwards in the ±7r 3 direction. A large mi would overcome this warping, 
still giving a vacuum with only a having an expectation value. However, as mi decreases 
in magnitude with a fixed 8m, there eventually comes a point where the warping dominates 
the tilting. At this point we expect a new symmetry breaking to occur, with 7r 3 acquiring 
an expectation value. This is sketched in Fig. (7). As 7r 3 is a CP odd operator, this is 
a spontaneous breaking of CP. The possibility of such a spontaneous breaking of CP was 
pointed out some time ago by Dashen [3]. 

To make this into a proper two dimensional phase diagram, I add an m 3 7r 3 piece to the 
potential. This effectively twists Mi away from being exactly perpendicular to M 2 . When 
such a term is present, it adds an explicit CP breaking term and can be expected to remove 
the transition, just as an applied field removes the phase transition in the Ising model. We 
thus have a phase diagram in the (mi,m 3 ) plane with a first-order transition connecting two 
symmetrically separated points on the mi axis. This is sketched in Fig. (8). 

The endpoints of this transition line are associated with the two points where one of the 
respective quark masses vanishes. The phase transition occurs when the two flavors have 
masses of opposite sign. Simultaneously flipping the signs of both quark masses can always 
be done by a flavored chiral rotation, say about the 7r 3 axis, and thus is a good symmetry 
of the theory. 

Taking one of the flavors to infinite mass provides a convenient way to understand the 
one flavor situation. As sketched in Fig. (8), this represents looking only at the vicinity 
of one endpoint of the above transition line. In terms of the light species, this transition 
represents a spontaneous breaking of CP with a non- vanishing expectation for iip"f 5 ip. In 
the lattice context the possibility of such a phase was mentioned briefly by Smit [16], and 
extensively discussed by Aoki and Gocksch [17]. 



I now return to the case where Mi is parallel to M 2 , but consider the consequences were 
the parameter (3 to be negative. As discussed above, this is unlikely to be the case for the 
continuum theory. Nonetheless, as I mention below, there is some evidence that this may 
be the case for the strongly coupled lattice model. 

If (3 is negative, then the warping will be upward rather than downward along the direc- 
tion of M 2 . The case of the (mi,m 2 ) plane, as discussed in section (III) for Mi||M 2 , would 
at first glance seem to interchange with the picture of section (IV), for Mi _L M 2 . Thus we 
expect an intermediate phase with a spontaneous generation of an expectation for the pion 
field. However, there is one crucial difference. In section (IV) I considered an explicit break- 
ing of flavor, using the third direction for the example. In contrast, here I do not include 
any explicit flavor breaking. Thus when, say, 7r 3 acquires a vacuum expectation, it could 
just as well have been tti or ir 2 . For the pion to gain an expectation requires a spontaneous 




V. CASE C: (3 < 
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breaking of flavor symmetry. This is a continuous symmetry, and two Goldstone bosons 
appear. 

The possibility of a phase with a spontaneous flavor breaking has been extensively dis- 
cussed by Aoki and collaborators [17] in the context of Wilson lattice fermions at supercritical 
hopping. They give rather convincing arguments that such a phase indeed exists for strongly 
coupled lattice gauge fields. Similar phases have also been found for non-gauge models based 
on four fermion couplings [20]. 

Based on the prejudice that for two light flavors the physical value of the parameter 
(3 should be negative, I suspect that these flavor violating phases are lattice artifacts. At 
strong coupling, (3 may indeed be negative, but as the lattice spacing is reduced, it should 
switch to its physical sign and physics should revert to the situation of the previous sections. 
In the process the flavor breaking phase should be pinched out, much as seen in Ref. [20]. 

VI. IMPLICATIONS FOR WILSON'S LATTICE FERMIONS 

The Lagrangian for free Wilson lattice fermions is [15] 

L(K,r,M) = 

£j> K + r )^3+e. + ^("^ + r)j> 3 ) (12) 

Here j labels the sites of a four dimensional hyper-cubic lattice, \i runs over the space time 
directions, and e M is the unit vector in the /x'th direction. I have scaled out all factors of 
the lattice spacing. The parameter K is called the hopping parameter, and r controls the 
strength of the so called "Wilson term," which separates off the famous doublers. I have 
also added to the theory of Ref. [15] an unconventional m 2 type mass term to make the 
connection with my earlier discussion. 

Being quadratic and only involving nearest neighbor couplings, the spectrum is easily 
found by Fourier transformation. I omit the straightforward and well known details. Let 
me only observe that as a function of the mass parameter mi, there are several places where 
the fermion spectrum has no mass gap. At these points there are massless particles in the 
spectrum. Conventionally, a massless fermion is obtained by taking mi = 8Kr, but there are 
other places where this original particle is massive while other doublers from the naive r = 
theory become massless. At mi = — 8Kr one such species does so, at each of mi = ±4ifr 
there are four massless doublers, and at M = I find the remaining 6 of the total 16 species 
present in the naive theory. 

My conjecture is that these various species should be thought of as flavors. When the 
gauge fields are turned on, the the full chiral structure should be a natural generalization of 
the earlier discussion. Thus near mi = 8Kr I expect a first-order transition to end, much as 
is indicated in Fig. (1). This may join with numerous other transitions at the intermediate 
values of mi, all of which then finally merge to give a single first-order transition line ending 
near mi = —8Kr. The situation near and ±4ifr involves larger numbers of flavors, and 
properly requires the analysis of the next section. However, one possible way the lines could 
join up is shown in Fig. (9a). 
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For two flavors of Wilson fermions, if we look near to the singularity at 8Kr we should 
obtain a picture similar to Fig. (2). However, further away these lines can curve and even- 
tually end in the structure at the other doubling points. One possible picture is sketched in 
Fig. (9b). There may still be an Aoki phase, as discussed in the last section, appearing at 
strong coupling. But in the weak coupling limit I expect that phase to be squeezed out. 

VII. GENERAL N f 

As mentioned in the introduction, the general flavor case is somewhat more complicated 
because non-anomalous axial currents can mix the a and r] fields, making their separation 
less helpful. Given Nf flavors, I turn to a matrix notation. My basic spin-zero fields are now 

£^,6 = ^(1+75)^6. (13) 

The indices a and b run over the Nf flavors, and c is a normalization factor as introduced 
in Eq. (4). 

Taking various traces will reproduce analogs of the fields introduced earlier. The a field 

is 

a = cJ2^a^a = Tv(U + U^). (14) 

a 

The pseudo-scalar meson fields are 

vr a = 2Tr(A a ([/ - [/+)). (15) 

Here A a represents a set of Hermitian and traceless generating matrices for SU(Nf), i.e. 
extensions of the Pauli matrices for SU(2). For normalization, I choose 

Tr(A a A fc ) = 28 ah . (16) 

The other relevant fields are generalizations of the ??, now called 77', and 8 

rj' = zTr(U - C/ f ) (17) 

8 a = Tr(A a (C7 + [/+)). (18) 

The effective potential V(U) is now a function of these Nj fields. The consequences of 
chiral symmetry for the massless theory are that independent "left" and "right" SU(Nf) 
rotations on the field U do not change the potential. Thus I have 

V{U) = V{g L Ug- R 1 ) (19) 

whenever gi, and gR are matrices from SU(Nf). The theory has an SU(Nf) X SU(Nf) 
symmetry. Because of anomalies, V(U) is not expected to be invariant under general phase 
changes. Thus Eq. (19) would not be true for arbitrary elements of U(Nf) X U(Nf). In 
particular, V(U) can depend on the determinant of U . 

I now assume that the massless theory undergoes chiral symmetry breaking in the usual 
manner; so, the field U acquires a vacuum expectation value. Furthermore, I assume that 
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flavor and parity are still good symmetries; so, I am free to pick this expectation value to 
be proportional to the unit matrix. This is analogous to taking the sigma direction for the 
breaking in the two flavor case. In this "standard" vacuum I have 

{U ab ) = v6 ab (20) 

Please don't confuse the Kronecker symbol here with the scalar field 6 of before. 

Chiral symmetry indicates that this choice is not unique; indeed, I could equivalently 
have chosen 

{Uab)g = V9ab (21) 

where g is an arbitrary element of SU(Nf) and labels the respective vacuum. The set of 
degenerate vacua thus directly maps onto the group SU(Nf). For two flavors this is the the 
four dimensional sphere (S3), projected onto an circle for the earlier figures. One difference 
for higher Nf is that anomaly free rotations, such as by nontrivial elements of the group 
center, can mix the a and 77' field. This is why the separation into fields E and A is not 
generally useful. 

I now add masses to perturb this manifold of vacua. Following the earlier discussion, the 
added terms are linear in the various scalar and pseudo-scalar quark bilinears 

V(U) — > V(U) + Tr(MU + U^M^)/c. (22) 

Here the mass matrix M is an arbitrary complex Nf X Nf quantity. Chiral symmetry says 

that physics is unchanged under taking M > g^Mg^ , where gi, and ga are elements of 

SU(Nf). In this way an arbitrary mass matrix can be put into some standard form. Perhaps 
the most natural is the product of a real positive diagonal matrix, where the elements are 
the quark masses, with an overall phase factor e %e l N f . 

To study the multi-flavor generalization of the (mi,m 2 ) phase diagram, I take all quark 
masses equal and let 

M ab = 6 ab (m 1 + im 2 ). (23) 

What does such a distortion do to the manifold of ground states? The space of possible 
vacua is multi-dimensional, but by restricting the mass to this simple form, the important 
coordinates are the real and the imaginary parts of the trace of U . To help visualize the 
relevant structure, I generated 10,000 pseudo-random SU(3) matrices with the invariant 
Haar measure, and plot in Fig. (10) the imaginary versus the real part of their traces. I 
also plot the boundary of the allowed region, mapped out as t increases by matrices of the 
form exp(2tA 8 ), where A 8 is chosen to be proportional to the diagonal matrix with elements 
{1,1,-2}. 

This figure dramatically shows the special role played by the center elements; these are 
of the form 

g a b = Sab exp(2-7r in/ N f ) (24) 

where n is an integer. Fig. (11) contains a similar plot for S'C/(4). This complex structure of 
the stationary points of the trace of an SU(N) matrix is probably connected with the phase 
transitions seen at finite coupling for lattice gauge theory with N larger than four [21]. 
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The role of the group center can be understood more formally. The energy of the physical 
vacuum should be stationary under small variations of the expectation value of U . Making 
a variation in the group direction says that the vacuum must satisfy 



= (Tr(MU\ a - X a U^M^)) . 



(25) 



Note that for the case of Eq. (23), where M is proportional to the unit matrix, this equation 
is satisfied whenever (U) is itself proportional to the unit matrix. This is because the lambda 
matrices are traceless. 

I assume that the perturbation is sufficiently small that the low energy states are still 
cleanly mapped onto the group SU(Nf). Those elements of SU(Nf) which are proportional 
to the unit matrix form the center of the group. Thus whenever U is a constant times one 
of these elements, we have a candidate for the vacuum in the presence of the above mass 
term. While there are other stationary states in the group manifold, they are saddle points, 
and the true vacuum satisfies 



The mass addition can re scale the quantity v from its value in the symmetry limit. 

The diagonal mass term of Eq. (23) may be thought of as tilting the energy of the matrices 
in the (ReC/, ImU) plane. Depending on the direction of this tilt, the lowest energy state 
always lies at one of the vertices of the distorted Nf polygon illustrated in Figs. (10,11). As 
the phase of the mass matrix changes, the preferred vacuum jumps from vertex to vertex. 
As we encircle the origin, we expect Nf first-order phase transitions as we jump through 
these Nf possible vacua, eventually returning to where we started. Each of the transitions 
is physically equivalent. The parameter 8 is Nf times the angle to the corresponding point 
in the (mi,m 2 ) plane; so, these transitions each correspond to 8 going through ir. 

Note how for Nf > 2 the group itself contributes to warping the manifold of possible 
vacuum states. Whereas for two flavors I needed the term proportional to /3 in the potential 
of Eq. (9), this is no longer the case with more flavors. The first-order transition at 8 = ir 
with many degenerate flavors comes naturally from the vacuum structure. 

Just as in the two flavor case, I expect flavor breaking to complicate the picture. A 
more general mass term can pull in other directions in the group manifold, and the group 
center need no longer play a special role. In particular, this can cause the transitions near 
the origin to separate. Depending on the detailed masses, the case 8 = ir need no longer 
necessarily have a phase transition. 



I have presented a physical picture of the parameter 8 in the context of an effective 
potential for spin-zero bilinears of the quark fields. I have argued for a first-order transition 
at 8 = ir when all flavors are degenerate, and shown how flavor breaking can remove this 
transition. 

The picture may need to be modified for very large masses if the global structure of the 
effective potential is sufficiently modified. For example, a large M 2 could conceivably give 
a large enough contribution to the a term in Eq. (9) to destroy the spontaneous symmetry 




(26) 



VIII. SUMMARY AND CONCLUSIONS 



12 



breaking on which the discussion is predicated. Then the first-order lines in Fig. (2) could 
end before reaching infinity. 

The parameter 6 is well known to generate CP violation [1-5]. For example, the a term of 
Eq. (9) can give r] — > 2ir decay when r] has an expectation value, as caused by the presence 
of M 2 . In the context of my discussion, the strength of this decay is undetermined, but 
Refs. [5] make estimates using the full SU(3) symmetry. They also estimate the strength 
of the resulting neutron electric dipole moment, which is experimentally much more highly 
constrained. The unnaturally small value of 8 is still unexplained. 

A number of years ago Tudron and I [22] conjectured on the interplay of the confinement 
mechanism with 6, and speculated that confinement might make 6 unobservable. Recently 
Schierholz [23] argued that keeping confinement in the continuum limit may drive the theory 
to 6 = 0. The connection with present discussion is unclear, but the symmetries seem to 
indicate no obvious problem with 6 being observable. Furthermore, the fact that the r] is 
lighter than particle candidates in the 6 channel suggests that there indeed must be the (3 
term of Eq. (9), and it is this term which is directly responsible for the physical dependence 
on 6. 

An interesting question raised in Ref. [3] is whether the spontaneous parity violation 
appearing in these models might be responsible for the parity violation seen in nature. 
Generalizing these mass terms to Higgs couplings between gauged and spectator fermions 
should make it possible to create a mirror fermion model wherein the different helicities 
have different masses. Such models have been proposed [24] to circumvent difficulties of 
putting chiral fermions on the lattice [25]. While these are not ruled out experimentally, 
the necessity of adding the mirror states seems a bit artificial. Thus it would be interesting 
to know if there are any problems in principle with taking the auxiliary fermion masses to 
infinity. This raises triviality issues that deserve further study. 

Finally, my discussion has been independent of conventional gluonic perturbation the- 
ory, and circumvents any perturbative mechanism for the 6 dependence. Indeed, this phe- 
nomenon is non-perturbative and usually discussed in terms of topological effects [1]. The 
consequences, however, are highly constrained by confinement and the symmetries of the 
theory, allowing much to be said without specifying the underlying details. 
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FIGURES 
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FIG. 1. The phase diagram for one flavor with the generalized mass term. The wavy line 
represents a first-order phase transition, along which ii/jjsi/j acquires an expectation value. The 
end point of this transition line is renormalized away from the origin towards negative mi. 
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FIG. 2. The two flavor phase diagram. First-order lines run up and down the m,2 axis. The 
second order endpoints of these lines are separated by a flavor breaking mass difference. The chiral 
limit is pinched between these endpoints. 
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FIG. 3. The "sombrero" potential when the quark masses vanish. 




FIG. 4. The effect of Mi on the effective potential. The ellipse in this and the following figures 
represents the minimum of the effective potential from Fig. (3). The dot represents where the 
vacuum settles. 




FIG. 5. The effect of M2 on the effective potential. The dots represent two places where the 
vacuum can settle. 




FIG. 6. Varying mi at fixed m,2. A first-order phase transition is expected at mi = 0. This 
corresponds to 9 = ir. The dots represent places where the vacuum can settle. 
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FIG. 7. Varying mi at fixed quark mass splitting. A second order phase transition occurs when 
the tilting is reduced sufficiently for a spontaneous expectation of tt^ to develop. The dots represent 
places where the vacuum can settle. 
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FIG. 8. The (7721,7723) phase diagram for unequal mass quarks. The wavy line represents a 
first-order phase transition ending at the second order dots. The light box on the right shows how 
the one flavor diagram of Fig. (1) is extracted. 
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FIG. 9. Possible phase diagrams for lattice gauge theory with Wilson fermions. The dashed 
lines represent first-order phase transitions and the dots represent points where massless excitations 
should exist. Parts (a) and (b) are for the one and two flavor cases, respectively. The number of 
lines joining at each of these points counts the number of doubler species becoming massless there. 
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FIG. 10. Taking 10,000 SU(3) matrices generated randomly with the invariant measure, I 
plot the real versus the imaginary parts of their traces. The boundary is obtained from matrices 
generated by Ag. 
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FIG. 11. Taking 10,000 SU(4) matrices generated randomly with the invariant measure, I 
plot the real versus the imaginary parts of their traces. The boundary is obtained from diagonal 
matrices generated by diag(l, 1, 1, —3). 
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